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Recombinant process for preparing a complete malaria antigen, g P 190/MSPi 

The invention concerns a recombinant manufacturing process for the complete malaria 
anfgen g P 190/MSP1, as well as separate naturally-occurring domains and parts of the same 
by expression of a synthetic DNA sequence. The invention concerns in addition the DNA 
sequences produced by the process and the host organisms used for the expression of the 
DNA sequences. In addition the invention concerns the use of the complete malaria antigen 
as well as parts thereof as a vaccine for immunization against malaria. 

Finally the invention under consideration concerns a stabilization process 

for AT-rich genes, as well as stabilized genes which are characterized by a reduced AT 

content. 



Malaria is one of the most significant infectious diseases in the world: According to WHO 
reports, in 1990 40% of the world population in 99 countries was exposed to the risk of 
malaria. At the same time its distribution is enormously on the increase. This may be ascribed 
above all to intensive development of resistance in the parasites causing malaria, promoted 
by the recommendation and use as prophylactics of the drugs intended for treatment Besides 
the search for new and effective chemotherapeutic agents hope is nowadays directed towards 
the development of vaccines, since people in areas of the world where malaria is epidemic do 
manage to develop some kinds of immunity. As well as a natural resistance to malaria such 
as that found in heterozygous carriers of the sickle-cell gene and people with thalassaemia 
and glucose-6-phosphate dehydrogenase deficiency, in the course of malarial infection in 
humans ,mmune mechanisms can be stimulated which express themselves in a heightened 
capacity for resistance to the Plasmodia. Consequently the course of the disease in 
populations exposed to severe epidemics is generally less threatening than in persons 
exposed to the infection less frequently or for the first time. 

The main problem in the development of a vaccine is the identification of an antigen which can 
.nduce protective immunity, since there is no easily accessible well-defined animal mode, 
avertable for the four parasites affecting man. The organism causing malaria belongs to the 
Plasmodium group, of which infection with one of the four parasites Plasmodium vivax 



Plasmodium ovale, Plasmodium malaria, or Imodium faktiparum resuas from me bite of 
Anopheles mosquitoes. Of aiese parasiies Piasmodium faloiparum is .he mos. dangerous ahd 
the most widely distributed. 

The main surface protein of .he menozoite. ta invasive form of ,ha btood stage o„he malana 
paresae Piasmodium ,a,o.parum and o.her malaria parasites suoh as P. viva* is a ,90.*0 
kD g ycoprotein. Late in me deve.opmen, of ,he parasite .his precursor is processed Wo 
smalter pro,e,ns. which can however Pa iso,a.ed from meroaoi.es as a unitary complex By 
means of a glycosylphosphatidyl-inositol Pond .his comptex is coupled to the merozoite 
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Mackay. M.. Goman, M. and Scaife, ,G. ,,987). Attelic dkaon».sm h a surface antigen gene 

of the malana parasite Piasmodium falciparum. J. Mo.. Biol. 195. 273-287 Millar L H 

.he Ptasmodium fatopanum merozoite surface protein-, (MSP-1,. Mo,. Biochem. Paras J 59. 
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falciparum merozoites. J. Immunol. 134, 1946-1951; Blackman. M.J.. Heidrich. H.-G., 
Donachie, S.. McBride. J.S. and Holder. A.A. (1990), A single fragment of a malaria merozoite 
surface protein remains on the parasite during red-cell invasion and is the target of invasion- 
inhibiting antibodies. J. Exp. Med. 172, 379-382). 



Finally, a series of vaccine studies have been carried out with gp190/MSP1 from P. falciparum 
on primates, particularly on Aotus and Saimiri monkeys (see also Perrin, L.H., Merkli, B.. 
Loche. M., Chizzolini, C. Smart. J. and Richie, R. (1984), Antimalarial immunity in Saimiri 
monkeys. Immunization with surface components of asexual blood stages, J. Exp. Med. 160, 
441-451; Hall. R.. Hyde, J.E., Goman, M., Simmons. D.L. Hope. I.A., Mackay, M. and Scaife, 
J.G. (1984), Major surface antigen gene of a human malaria parasite cloned and expressed in 
bacteria. Nature 311, 379-382; Siddiqui. W.A.. Tarn, L.Q.. Kramer. K.J., Hui, G.S.N., Case, 
S.E., Yamaga. K.M.. Chang, S.P., Chan, E.B.T. and Kan, S.-C. (1987). Merozoite surface coat 
precursor protein completely protects Aotus monkeys against Plasmodium falciparum malaria. 
Proc. Natl. Acad. Sci. USA 84, 3014-3018; Ettlinger, H.M., Caspers, P.. Materile, H., 
Schoenfeld H.-J., Stueber. D. and Takacs, B. (1991), Ability of recombinant or native proteins 
to protect monkeys against heterologous challenge with Plasmodium falciparum. Inf. Imm. 59. 
3498-3503; Holder. A.A., Freeman, R.R. and Nicholls. S.C. (1988), Immunization against 
Plasmodium falciparum with recombinant polypeptides produced in Escherichia coli, Parasite 
Immunol. 10. 607-617; Herrera. S.. Herrera, M.A., Perlaza. B.L. Burki. Y.. Caspers, P., 
Doebeli. H.. Rotmann D. and Certa. U. (1990). Immunization of Aotus monkeys with 
Plasmodium falciparum blood-stage recombinant proteins. Proc. Natl, Acad. Sci. USA 87, 
4017-4021; Herrera. M.A., Rosero. F.. Herrera, S.. Caspers. P.. Rotmann. D.. Sinigaglia. F. 
and Certa, U. (1992). Protection against malaria in Aotus monkeys immunized with a 
recombinant blood-stage antigen fused to a universal T-cell epitope; correlation of serum 
gamma interferon levels with protection. Inf. Imm. 60, 154-158; Patarroyo. M.E.. Romero, P., 
Torres, M.L., Clavijo, P.. Moreno. A.. Martinez A., Rodriquez, R.. Guzmann. F. and Cabezas, 
E. (1987), Induction of protective immunity against experimental infection with malaria using 
synthetic peptides, Nature 328, 629-632). In these vaccine studies two premises may be 
distinguished: 

- Use of material isolated from parasites, and 




- Administration of material procured in heterologous systems of expression. 

The latter consists as a rule of relatively small segments of the total protein. Although the 
results of the inoculations carried out preliminarily on monkeys indicate that gp190/MSP1 
could bring about protection, all the experiments carried out on primates have two problems, 
which place such a conclusion in question: 

(a) they were carried out on too small groups of animals 

(b) they were not repeated. 

The results and the. conclusions drawn from them are consequently not statistically confirmed. 
Besides the difficulty of access to suitable monkeys there remains the main basic problem, 
that it has so far not been possible to manufacture good vaccination material in a suitable 
quantity. 

On the other hand, after the sequencing of the gp190 gene from the K1 and MAD20 strains pf 
Plasmodium falciparum overlapping fragments could be expressed in E. coli. With this 
material epidemiological studies in West Africa showed that in the adolescent group a 
correlation existed between antibody titre against gp190/MSP1 fragments on one hand and 
protection from parasite infection on the other. In addition the titre also appeared to correlate 
with the capacity to control the parasitaemia even at a low level (Tolle et al. (1993): A 
prospective study of the association between the human humoral immune response to 
Plasmodium falciparum blood stage antigen gp190 and control of malarial infections, Infect. 
Immun. 61, 40-47). These results are supplemented by new investigations on Aotus monkeys 
in the framework of the present invention. Here an enhanced protection against infection with 
the parasite was attained because protein preparations from Plasmodium falciparum, which 
consisted predominantly of unprocessed gp190/MSP1, had been used as vaccine. The 
monkeys with the highest antibody titres against gp190/MSP1 were the best protected. These 
results eventually indicated gp190 as a most promising candidate for a vaccine against 
tropical malaria. 
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By some groups of workers the C-terminal domain 
of gpl90 (pis or p42) is assigned a particular role in the 
immunity mediated by gpl 90 (see also Chang, s.P. case 
S.e.. Gosnell, W . L .. Hashimoto, A., Kramer, K.J ' Tarn ' 
L.Q., Hashiro, C.Q., Nikaido, CM. , Gibson, H.L., Lee-Ng 
C.T.. Barr, P. j. , Yokota, B.T. and Hui, G.S.N. (1996) A 
recombinant baculovirus 42-kilodalton C-terminal fragment 
of Plasmodium falciparum merozoite surface protein 1 
protects Actus monkeys against malaria. Inf. Xrm. 64, 253- 
261; Burghaus, P. A . , wellde, B.T., Hall, T. , Richards, 

l'l" t TZ: A ' F " RilSy ' E - M -' Ri ^y-^on, W. and Holder 

(1 " 6) ' In ™unization of Aotus nancymai with 
recombinant C-terminus of Plasmodiu. falciparum merozoite 
surface protein 1 in lipsomes and alum adjuvant does not 
induce protection against a challenge infection, inf. imm 
in press . 
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Thus far, however, it has also been impossible to 
exclude other parts of gpl90 on a rational basis as 
irrelevant to a protective immune response. Hence it is as 
necessary as ever to use the entire gene or the intact 
gpl90 for vaccine investigations. Despite multiple 
investigations by various work-groups, however, there has 
not yet been any success in cloning and expressing the 
entire gpl90/MSPl gene. 

Nor has it so far been possible to exclude a 
priori any part of the gpl90 sequence as irrelevant to the 
protective immune response, so that it is as necessary as 
ever to use the entire gene or gene product for vaccine 
investigations. Nevertheless, despite many investigations 
by a number of working groups there has not yet been any 
successful cloning of the whole gene for g P 190/MSPl. 

The present invention provides a means of making 
avaxlable an adequate quantity of vaccine material in the 
form of the complete g P 190/M S Pl. The present invention 
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further provides a process by which this vaccine material 
could be recovered. 

In addition the present invention provides a 
complete DNA sequence of gpl90/MSPl which could be 
expressed in a host organism. 

The present invention also provides a host 
organism containing the complete gpl90/MSPl gene. 

Finally, the present invention provides a 
stabilisation process for AT-rich genes, as well as a 
stabilised gene suitable for expression characterised in a 
reduction of the AT content. 

In the following, certain concepts are explained 
xn more detail in order to make clear how they should be 
understood in this context. 
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Throughout the description and claims of this 
specification, the word "comprise- and variations of the 
word, such as "comprising- and "comprises", means 
"including but not limited to", and is not intended to 
exclude other additives, components, integers or steps-. 

"Recombinant manufacturing process- means that a 
protein of a DNA sequence is expressed by a suitable host 
organism in which the DNA sequence has arisen from cloning 
and fusion of individual DNA fragments. 

"Complete g P 190/MSPl protein- here means the" 
entire g P 190/MSPl surface protein isolatable from the above 
named Plamodia, especially Plamodium falciparum 
representing the main surface protein. of the above named 
parasite as well as the proteins with analogous function 
from the Plasmodium species such as P. vivax. The term 
therefore comprises in each case the main surface protein 

^««^-«il«\hoc»$x l krot\K«ep\.pociN4t««-f7 .doc 21/06/00 
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of the merozoites of the four malaria parasites named above 
as dangerous to man. "Complete gpl90/MSPl gene" means the 
gene coding for this protein. In this context "complete" 
signifies that the entire amino-acid sequence of the native 
protein is present or that the gene sequence codes for the 
entire amino-acid sequence of the native protein. Mutated 
and/or shortened forms of gp!90/MSPl are however included 
therewith insofar a they display the same immunisation 
potential (vaccine protection) as the complete gpl90/MSPl. 
Finally the term also includes variants of gpl90/MSPl 
characterised by containing in one molecule protein 
fragments of various alleles. 



"FCB-1" is a strain of P. falciparum such as that 
15 described in Heidrich, H.-G., Miettinen-Baumann, A., 

Eckerskorn, C. and Lottspeich, F. (1989) The N-terminal 
amino acid sequences of the Plasmodium falciparum (FCB1) 
merozoite surface antigens of 42 and 36 kilodalton, both 
derived from the 185-195-kilodalton precursor. Mol. 
20 Biochem. Parasitol. 34, 147-154. 
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"Attachment signal" here means a protein 
structure for by a DNA sequence at the 3' or 5' end of the 
gene according to the invention. Attachment signals are 
structures enabling the attachment of a polypeptide to 
other structures, such as for example membranes. 



"Signal peptide" here signifies a protein 
structure coded for which a DNA sequence at the N-terminal 
30 end of the gene according to the invention codes. Signal 
peptides are structures which among other things enable 
penetration of the polypeptide into membranes. 



35 



In the context of the present invention W AT- 
content" means the percentage amount of adenine -thymine 
base pairs compared to guanine-cytosine base pairs. 
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"Cloning" will comprehend here all known state- 
of-the-art cloning methods which could be applied here, but 
which are nevertheless not all described in detail because 
5 they belong among the normal tools of the person skilled in 
the art. 

"Expression in an appropriate expression system" 
should here include all known state-of-the-art methods of 
10 expression in known expression systems which could be 

applied here, but which are nevertheless not all described 
in detail because they belong among the normal tools of the 
person skilled in the art. 



•••• 
» • * • 



15 The present invention provides a process by which 

the protein pgl90/MSPl and its gene can be produced in 
sufficient quantity without excessive cost. 

s 

For the first time it is possible by this process 
20 to synthesise the protein in its entirety outside the 

parasite. As the analysis with conformational epi tope- 
recognising monoclonal antibodies shows, the protein thus 
synthesised is at least reproducibly synthesisable over 
wide areas in naturally folded form. By the recombinant 
25 manufacturing process many milligrams of intact gpl90/MSPl 
could in every case be recovered from the host organism, a 
quantity which for 
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expend vaccine against malarla . Furthenwe , ^ „ lhe ™ 1 

« tang vaccines as we„ as for vaccines based on nudero acids. ' 

representative of the "K1 allele" where ki cla „He * 

' 6 K1 stands for a Particular P. falciparum strain h* 
coding sequence extends oveMS17 base pairs and lncK.de, a s, ra , sequence" 
form™, end as we,, as an attachment sequence a, theC-terminal. 

Funhemtore. according to lhe invent the recombinant manufacturtng process ls 

™ ,n - - «— ■ — p- se,ue 9 nZ;: 
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pro,e,n is based reduced reiafive to me wrtd type, foont 74% in the ohgina, gene preterab, , to 
about 55%, fo, example while the amino-acid sequence of the FCB L 

DNA sequence with the codon frequences usual in Zn " 
™*™_ "equenaes usual in the human genome is produced Other 

codon frequenaes which reduce the AT content are also conceivable. 

oTr'7 7 Unaerty ' n9 ™ * "» ~unng 

inanomer preferred embod^en, . the gene on whfch »e protein produced by the 
except for me GPI attachment signal. This embodimentis men described as 9P 190 «. 
In ye, another preferred embodiment, me gene on wTtich me protein produced by me 



In a further preferred embodiment type, the gene on which the protein produced by the 
recombinant manufacturing process is based codes for the complete amino-acid sequence 
and a trans-membrane attachment sequence. 

In a particularly preferred embodiment the recombinant manufacturing process includes the 
following steps: 

in the first place the design of the DNA sequence to be synthesized on the basis of the gene 
from P. falciparum FCB-1 , in which a DNA sequence with for example the codon frequencies 
common .n the human genome is manufactured with retention of the amino-acid sequence of 
the FCB-1 protein. 

The AT content of the gene should be reduced by this, preferably to 55%. Further on in the 
process the planned sequence is divided for example into 5 overlapping regions, which at the 
same time correspond to domains of the natural processing products of gp190/MSP1 from 
FCB-1: p83, p31, p36, p30 and p19. 

Desoxyoligonucleotides are synthesized, which in each case extend the entire length of a 
region. 

The desoxyoligonucleotides so synthesized are particularly preferred where their sequence 
corresponds in an alternating manner to the "upper (5' - 3') or the "lower (3' - 5") DNA 
strand. The length of these oligonucleotides is preferably on average 120 nucleotides and 
they overlap the neighboring sequences in each case by about 20 bases. 

In one possible embodiment DNA sequences of about double the iength of the existing end- 
products are manufactured by asymmetrical PGR, in effect so that the superfluous DNA 
sequences nearby in each case represent the opposite strand. This leads in a second 

PCR amplification cycle to a second product corresponding to the length of four originally 
-nserted oligonucleotides excluding the overlapping region. Transfer of these products to a 
preparafion consisting predominantly of single-stranded DNA by asymmetrical PCR with the 
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terminal oligonucleotides permits the manufacture in a 
further amplification step of an 800-bp long double- 
stranded DNA fragment in only 25 PCR cycles. 

In this manner the regions coding for pl9, p3 0, 
p36 and p31 are directly synthesised and molecularly cloned 
in E. coli. Clones with fault-free sequences are conserved 
either directly or by the joining up of fault-free sequence 
fragments. The region which codes for p83 is constructed 
by fusion from two sequences comprising about 1200 bp. 

In the further course of production single 
sequences are cloned. As expression vectors candidates 
preferred are the plasmids pDS56, RBS11 pHochuli, E. , 
Bannwarth, W, , Doebeli, H., Gentz, R . and Stueber, D. 
(1988) Genetic approach to facilitate purification of 
recombinant proteins with a novel metal chelate adsorbent. 
Bi^techn. 6, 1321-1325"), pBi-5 ("Baron, U. , Freundlib, S., 
Gossen, M. and Bujard, H. (1995) Corregulation of two gene 
activities by tetracycline via a bidirectional promoter. 
Nucl. Acids Res. 23, 3605-3606*) and ppTMCS. It is 
possible nonetheless also to conceive of other expression 
vectors . 

Host organisms preferred for expression are E. 
coli, with the strain DH5alphaZl specially preferred (R. 
Rutz, Dissertation 1996, Heidelberg University), HeLa 
cells, CHO cells, Toxoplasma gondii (Pf ef f erkorn, E.R. and 
Pfefferkorn, C.C. 1976, Toxoplasm gondii : Isolation and 
preliminary characterisation of temperature-sensitive 
mutants. Exp. Parasitol. 39, 3 65-376) or Leishmania. 
Additional host systems might be e.g. yeasts, baculoviruses 
or adenoviruses, so that the subject matter of the 
invention should not be limited to the host systems 
mentioned. 

The present invention also provides a complete 
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DNA sequence, suitable for expression, of the gpl90/MSPl 
surface protein of P. falciparum. 

In a preferred embodiment of the present 
invention the sequence suitable for expression codes for 
the complete amino-acid sequence. 
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In another preferred embodiment of the present invention the sequence suitable for 
expression codes for the complete amino-acid sequence except for the attachment signal. 

In a further preferred embodiment according to the present invention the DNA sequence 
suitable for expression codes for the complete amino-acid sequence except the attachment 
signal and the peptide signal. This embodiment of gp190/MSP1 can hence be characterized in 
including at the N-terminus 1 1 additional amino-acids, of which 6 are histidines. 

Particularly preferred the DNA sequence suitable for expression contains no recognizable 
"splice-donor" and "splice-acceptor" sites, and is preferably characterized in not containing 
any larger GC-rich sequences which might result in stable hairpin structures at the RNA level. 

Recognition signals for restriction enzymes which recognize sequences of six or more base 
pairs should preferably be avoided. 

In a preferred embodiment specific cleavage sites for restriction endonucleases, occurring 
only once in the gene, are introduced into regions to separate the existing domains following 
processing of the protein. 

Particularly preferred would be the presence at both ends of the gene of sequences for 
restriction endonucleases which do not occur in the gene. 

Furthermore host organisms containing the complete sequence of gp190/MSP1 surface 
protein are provided by the invention. 

Such host organisms are preferably E. coli, particularly preferred being the strain 
DH5alphaZ1, HeLa cells, CHO cells, Toxoplasma gondii or Leishmania. The HeLa and CHO 
cells ought preferably to synthesize constitutively tTA. 
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Finally the present invention provides a possibility of using a gp190/MSP1 surface protein 
created produced according to the recombinant manufacturing process, or parts thereof, for 
active immunization against malaria. 

The scheme for synthesis presented here also permits manufacture of the second allele of the 
gp190/MSP1 gene, whereby the dimorphism of the protein is also taken into account. The 
main variability of the protein depends however on the sequences of two relatively short blocs, 
blocks II and IV (ref. 1 ), which are oligomorphic. The present sequence data make it possible 
to disclose over 95% of all known gp190/MSP1 sequences with 6-8 sequence combinations of 
these blocs. The synthesis of these sequence variants can be brought about problem-free by 
means of the strategies proposed here, so that variants can be built up both in the K1 and in 
the MAD20 allele. Vaccines from the families of sequences thus created can confer protection 
where required against a wide spectrum of parasites with gp190/MSP1 variants. 

The manufacture of different types of vaccine is possible: 

- At the level of protein preparations, where in each instance mixtures of the two families (K1 
type, MAD20 type with different variants of Blocs II and IV) can come into use. Various carrier 
or adjuvant materials could be added: aluminum oxide, liposomes, IscomsQSzl , etc. 

- At the level of live vaccines: (a) viral carriers, especially vaccinia and adenoviruses; (b) 
parasites as carriers, particularly avirulent forms of Leishmania and Toxoplasma; (c) bacterial 
earners, e.g. Salmonella. 

- At the level of nucleic acids, whereby for example vectors suitable for gene therapy would 
be used to introduce the gene into the host; beyond that the introduction of nucleic acids 
coding for the desired protein can be envisaged. 

A further possibility for vaccination lies in the use of a gp190/MSP1 protein produced 
according to the recombinant manufacturing process set out by the invention, for the 
production of monoclonal antibodies which can then be used in their turn for passive 
immunization against malaria. 
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Similarly it becomes possible to use the DNA sequence on which the protein is based at an 
intermediate stage arising in the course of the recombinant manufacturing process for the 
construction of a vaccine based on nucleic acids. 

Finally the invention also concerns a process for the stabilization of gene sequences, 
especially for sequences which do not show adequate stability in expression systems. 

According to the invention this stabilization is attained because the AT content of the 
sequence is reduced. 

Moreover a stabilized gene characterized by having a reduced AT content is provided by the 
invention. An example of such a stabilized gene is the gene for gp190/MSP1 surface protein 
according to the present invention. 

In the following the invention will be described with the help of figures and tables as well as 
some examples in individual embodiments. 

They show: 

Fig. 1: Schematic representation of the gp190/MSP1 precursor 
protein from P. falciparum (FCB-1). 

Fig. 2: Two vaccine trials carried out on Aotus monkeys with 
native gp1 90/MSP1 from P. falciparum (FCB-1 ). 

Fig. 2A: With 3 x 60 micrograms gp190/MSP1 

Fig. 2B: With 3 x 40 micrograms gp190/MSP1 

Fig. 3A: Strategy of synthesis of the gp190/MSP1 gene 
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Fig. 3B: Principle of PCR-based total synthesis 

Fig. 3C: Total sequencing of gp190 s 

Fig. 3D: N- and C-termini of gp190 s1 variant 

Fig. 4A: Expression vector pDS56 with gp190 s2 sequence 

Fig. 4B: Gel electrophoresis of gp190 S2 

Fig. 5A: Expression vector pBi-5 with gp190 si sequence 

Fig. 5B: Immunofluorescence of HeLa cells 

Fig. 5C: Electrophoretic characterization of gp190 s1 
purified from HeLa cells 



Fig. 6A: Expression vector ppT 190 with gp190 sequence 

Fig. 6B: Immunofluorescence of the expression of gp190 s 
in T. gondii 

Fig. 6C: Polyacrylamide gel electrophoresis of gp190 from 
T. gondii 

In the gp190/MSP1 precursor protein from P. falciparum schematically represented in Fig. 1 
the dark blocs stand for regions which are strongly conserved in all strains. The cross-hatched 
blocs indicate the dimorphic areas, which in the case of the FCB-1 isolate derive from the K1 
allele. 01 and 02 indicate the oligomorphic areas. S denotes the peptide signal sequence 
containing 19 amino-acids, GA the C-terminal region, which includes the signal for the GPI 
attachment of the protein to the membrane. The arrows indicate the sites of the processing by 
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which the proteins p53, p31, p36 f p30 and p19 arise. The gp190 gene codes for altogether 
1639 amino-acids. 

The other figures are more conveniently explained in the context of the following Examples. 
EXAMPLES 

Example 1: total synthesis of one of the DNA sequences coding for gp19Q/MSP1 (see Fig. 3) 

A. Strategy of synthesis of the 0P190/MSP1 gene (gp190 s ) (see Fig. 3A). 

The sequence was divided into fragments corresponding to the main processing products: 
p83, p31, p36, p30 and p19. In the transition regions cleavage sites for restriction 
endonucleases (arrows in fig.3) were inserted in such a way that the amino-acid sequence 
was not altered. All the particular cleavage sites are found only once in the sequence. 

The fragments were synthesized to overlap, so that the cleavage sites at the respective ends 
made attachment by ligation to the neighboring fragment possible. All individual fragments 
contain in addition at their 5' ends a BamHI division site for insertion into expression vectors. 
The entire sequence could be cloned via MM and Clal. The scheme indicated here leads in 
addition to a sequence which cannot produce the GPI attachment since the C-terminal lacks 
18 amino-acids. Synthesis of a corresponding oligonucleotide as well as of a "primer" 
extending over the Sphl cleavage site, leads after PCR to the GA fragment which could be 
used by Sphl and Clal, the resulting total sequence being gp190 s . On removing the sequence 
coding for the peptide signal, "PCR Primer" is produced, over which the fragment AS has 
been synthesized. It is permissible to alter the N-terminal via a BamHI and a Hindlil cleavage 
site in such a way that the protein begins with amino-acid no. 20. The nuclear sequence which 
encodes gp190/MSP1 without signal sequence and without GPI attachment signal was 
designated gp190S2. Deletion of the GPI attachment signal alone leads to gp190 s1 . 
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Example 2: Expression of qd190 S2 in E. coii 

A. Expression vector (see Fig. 4A) 

The gp190 S2 sequence was inserted via the BamHI and Cial cleavage sites into pDS56RBSII, 
by means of which 6 histidines as well as some amino-acids originating in the vector were 
fused to the N-terminus. This produces the following N-terminal sequence on the reading- 
frame: Met Arg Gly Ser (His) 6 Gly Ser. Through the promoter P^siaco-i the transcription comes 
under lacR/O/IPTG control. 

B. Expression and purification of qp190 S2 (see Fig. 4D) 

Carrying over the vector pDS56RBSIIgp190 S2 into E. coli DH5alphaZ1 and induction of 
synthesis through IPTG resulted after electrophoretic separation of the total protein extract 
from the culture in a clearly visible band of the anticipated size (arrow). Purification of the 
material through I MAC and affinity chromatography (antibody column with mAK5.2) led to a 
homogeneous product of about 190 kD. In the Figure M stands for molecular weight 
standards; 1 = E. coli before; 2 = after induction with IPTG for 2 hours; 3, 4, 5 = fractions from 
elution of the mAK column. 

Example. 3: Tetracvcline-controlled expression of gp190 s1 in H eLa and CHO cells and 
isolation of the product (see also Fig. 5 and 6c) 

A. The gp190 sequence was inserted via the BamHI/Clal cleavage sites into the expression 
vector pBi-5. In this way transcription of the gene came under the control of a bidirectional 
"tTA-reponsive" promoter and could be regulated through Tc. The bidirectional promoter 
simultaneously initiated transcription of the indicator gene luciferase. In consequence the 
regulation of the expression could easily be followed (see also Fig. 5A). 
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B. Immunofluorescence of HeLa ceils, which express luciferase and qp190 s1 under Tc control 

The production of luciferase (left), gp190 s1 (middle) in the absence of Tc was demonstrated in 
MTA93-9 cells, which contain the bidirectional transcription unit of (A). Following addition of Tc 
no noteworthy synthesis of gp190S1 was shown (as repr sented in Fig. SB.right). 

C. Electrophoretic characterization of qp190 s1 purified from HeLa cells 

The HeLa cell clone HtTA93-9 as well as the CHO cell clone CH027-29 have been cultivated 
with or without Tc. Cell extracts separated by electrophoresis have been analyzed with 
mAK5.2 by means of "Western blotting" (Fig. 5C); analysis of the CHO cell line is shown on 
the left, of HeLa on the right. (1) = culture without, (2) = culture with Tc, (3) = non-transfected 
HtTA-1 cell line. Molecular weight standards are in each case indicated on the left. 

D. Purification of qp190 s1 synthesised by HeLa cell clone HtTA93-9 

Preparative cultivation of the HtTA line and induction of expression of gp190 s1 by withholding 
Tc permitted isolation of the gene product by affinity chromatography (mAK5.2 column). 

The polyacrylamide gel stained with Coomassie (Fig. 6C) following electrophoresis displayed a 
product consisting of gp190 s1 as well as another protein of about 50 kD. The latter was not 
derived from gp190 s1 and thus originated from the HeLa cells. Its projected removal should 
nevertheless present no difficulty in principle. 

Example 4: Expression of qp190 s1 in Toxoplasma gondii and purification of the product (see 
also Fig. 6). 

A. The gp190 s sequence was inserted into the vector ppT via Mlul/Pstl. This brought the gene 
under the control of the tubulin promoter (P tU b-i) of T. gondii. The 3' untranslated region (VTR) 
originated from the main surface protein of T. gondii (SAG-1). 




B. Expression nf gpl q q s j n J. gondii 

Transfection of T. gondii with pTT190 led to the isolation of parasite lines which expressed 
constitutive* gP i90 s Immunofluorescence with mAK5.2 (middle picture) showed not only 
expression of the gene but also situated the binding of the expression product close to the 
surface of the parasite, since it. like SAG-1 . provokes the same pattern of 
immunofluorescence (right section of fig. 6B). On the left in Fig. 6B a phase contrast 
photograph of the middle picture is shown. 

C. Isolation of gp190S from T. gondii 

By means of affinity chromatography (mAK5.2 column) g P 190 s was purified from a prepared 
quantity of T. gondii (5 x 10* parasites). The extremely pure protein possessed the anticipated 
molecular weight, as the Coomassie-stained polyacrylamide gel indicated following 
electrophoresis (2-3 on Figure 6C). At no. (1) on Fig. 6C purified gp190* from CHO cells is 
represented with molecular weight marked on the left side. 

Example 5: Characterization nf n , P i 9 Q s witn mn nnclonal antihrvtips 

The interaction of 16 monoclonal antibodies with gp190 s from the various heterologous 
expression systems was reviewed by immunofluorescence on P. falciparum and T. gondii or 
by "Western blot" on the purified proteins. Complete agreement was found when the two 
parasites were compared (number of + s indicates the relative intensity of the fluorescence) 
On Western blotting 12mAK's reacted with g P 190 s from E. coli and T. gondii. On the other 
hand 3 antibodies did not bind to material isolated from CHO cells. Antibodies 15 and 16 
wh.ch recognize epitopes from the oligomorphs or the alternative allele (MAD20) did not 
react. The results are summarized in Table 1. in which ND means "not carried out" 
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Example. 6: expression of qp19Q s in heterologous systems 

1. Expression in E. coli 

The gp190 S2 was inserted into the expression vector pDS56, RBSII, where it came under 
control of the promoter PN 2 siad>i. which can be controlled via the lac operator/repressor/IPTG 
system (Fig. 4A). Transfer of the plasmid into repressor-producing E. coli cells, eg E. coli 
DH5alphaZ1 , permitted expression of pg190 S2 under IPTG control. By means of a nickel- 
chelate column the raw product could be isolated via the N-terminal (His) 6 sequence 
introduced by the vector. An ensuing affinity chromatography on an antibody column led to an 
extremely pure preparation. Since the monoclonal antibodies used (mAK5.2) recognized a 
conformational epitope in the C-terminal region, this 2-step purification selected a full-length 
intact protein with correct folding at least at the C-terminus (Fig. 4B). 

In contradistinction to the natural material the end-product possesses 1 1 additional amino- 
acids at the N-terminus, of which 6 are histidines. It contains no N-terminal signal and also no 
C-terminal attachment sequence. The P. falciparum-specific sequence begins with amino-acid 
20 and ends with amino-acid 1621. 

2. Controlled expression of qp190 s1 in HeLa and CHOcell cultures 

The gp190S1 was inserted into the vector pBi-5 and thereby placed under control of a 
promoter regulable by tetracycline (Tc). The Tc-contolled system was chosen for 2 reasons: 

- It belongs to the expression systems with which the highest yield is obtained in mammalian 
cells. 

- Unsecreted foreign proteins at high concentration can interfere negatively with cell 
metabolism. Synthesis of the desired product is consequently begun only after maturation of 
the culture. 
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In the construct P Bi5-gp190 s1 a bidirectional promoter was activated by the Tc-controlled 
transcription activator and initiated transcription of both gpl 90 s ' and the luciferase indicator 
gene. In the presence of Tc the promoter is inactive. The transcription unit was transferred into 
both HeLa and CHO cells, which both synthesize constitutively tTA (HtTA line: Gossen. M. and 
Bujard. H. (1 992). Tight control of gene expression in mammalian cells by tetracycline-' 
responsive promoters. Proc. Natl. Acad. Sci. USA 89. 5547-5551; CHO-tTA line, 
unpublished). Through cotransfection (Ca 2+ -phosphate method) with a hygromycin- 
resistance-inducing marker gene was selected for successful chromosomal integration. 
Hygromycin-resistant clones were then investigated for regularity of the expression >Tc in 
which luciferase activity was used as indicator. The 9P 190 synthesis was tested in well 
regulable clones (regulation factor -Tc 1000). Immunofluorescence analysis (Fig. 5B) as well 
as investigation by "Western blot" (Fig. 5C) allowed the identification in both cell types of 
clones which synthesized gpl90 under strictly regulable conditions. The best regulable of 20 
clones were in each case subcloned. The subclones HITA93-9 and CH027-29 were used for 
cultures on a scale of 10:1. From cell extracts of these cultures intact gp190 S1 could be 
isolated by means of affinity chromatography (mAK5.2). The material was homogeneous 
except for a single cellular component which did not derive from gp190 s ' and made up 25% of 
the preparation (Fig. 6C). It had to be removed in a further purification step. 

3. Expression of g P l90S in Toxoplasma g ondii 

Like P. falciparum. Toxoplasma gondii belongs to the Apicomplexa and consequently has a 
protein modification system apparently similar to that of P. falciparum. T. gondii can be 
transfected with foreign DNA which is efficiently integrated into the genome and furthermore 
allows problem-free multiplication of T. gondii in cell culture. To obtain a product most like 
native g P 190. gpl90S2 is expressed in such a way that the protein is secreted on the surface 
of the parasite and. as in P. falciparum, bound to the membrane via a GPI analogue In that 
way the g P 190S2 (Fig. 3A) has been inserted (Fig. 6A) into the plasmid ppTMCS (D Soldati 
unpublished) and thereby placed under the control of the T. gondii tubulin promoter. 

This expression construct was transfected into T. gondii. Selection with chloramphenicol led to 
res.stant clones synthesizing gpi 90 which were detected by immunofluorescence (Fig. 6B). 
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The immunofluorescence with anti-gp190 antibodies was indistinguishable from a 
corresponding pigmentation of the parasites by means of antibodies against SAG1 , the main 
surface protein of T. gondii. It may be deduced from this that gp190 is bound to the surface of 
T. gondii. Several T. gondii clones (Nos. 3.1 to 3.4) were characterized and saved for the 
production of gp190. Using affinity chromatography (mAK5.2) gp190 was isolated from T. 
gondii cultures (clone 3.4) cultivated on a preparative scale. Electrophoretic analysis revealed 
a homogeneous product with a migration rate similar to that of the intact protein (Fig. 6C) 

Example 7: Characterization of gp190 protein from various expression systems by means of 
monoclonal antibodies. 

A set of gp190-specific monoclonal antibodies, of which a number recognize conformational 
epitopes, were used to compare the reactivity of the antibodies with P. falciparum and T. 
gondii parasites via immunofluorescence. Table 1 shows that the reactivity of the 16 
antibodies with either parasite is the same. This is a strong indication that in T. gondii "native" 
gp190 is being mostly produced. Comparison of the reactivity of the antibodies with protein 
from E.coli, HeLa or CHO cells as well as T gondii shows also that most of the antibodies react 
with the 4 preparations. In particular the protein derived from E. coli recognizes more of the 
antibodies than that produced in mammalian cells. This is apparently a consequence of 
glycosylation in mammalian cells. 

Example 8: Immunization of Aotus lemurinus qriseimembra monkeys with qp19Q/MSPfrom P. 
falciparum (FCB-1). 

Two independent immunization experiments (A, B) were carried out. In them in one instance 
(A) 1.0 mg and in the other (B) 0.6 mg of very pure gp190/MSP1 was extracted from about 2 x 
10 11 parasites respectively. 

The protein was administered together with Freund's Adjuvant (FCA). The control group 
received only FCA. Immunization equally with the protein mixture or the adjuvant was done 
three times at intervals of 4 weeks. Two weeks after the last immunization each of the animals 
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was infected with 10 s parasites (FVO strain) from a donor animal. Parasitaemia was measured 
daily. The results are summarized in Fig. 2. The symbols mean: 

T: that the animals were treated with resochin 

D: a dead animal 

Fig. 2A: individuals in the vaccinated group each received 
3 x 60 micrograms gp190/MSP1 

Fig. 2B: individuals in the vaccinated group each received 
3 x40 micrograms gp190/MSP1 

While in the control group only 1/1 1 animals did not develop parasitaemia, this was 6/10 in the 
vaccinated group. The four animals in the vaccinated group who did develop a pronounced 
parasitaemia did so - in comparison to the control group - with an average delay of four days 
(exceeding the 2% limit of parasitaemia). 

These experiments indicate for the first time a highly significant protection by gp190/MSP1 
against infection with P. falciparum in a monkey model. The process according to the invention 
consequently permits a practical vaccine against malaria to be presented for the first time. 
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THE CLAIMS DEFINING THE INVENTION ARE AS FOLLOWS: 

1. a process for preparing the complete gpl90/MSPl 
protein of Plasmodium comprising the step of expressing a 

5 gene for gp!90/MSPl protein, wherein in the DNA sequence of 
the gene has a reduced AT content in comparison to the 
naturally occurring sequence. 

2. A process according to claim 1, wherein the 
10 Plasmodium is Plasmodium falciparum. 

2. A process according to claim 1 or claim 2, 

wherein the expression takes place within a host organism. 

15 4. a process according to any one of claims 1 to 3, 

wherein the DNA sequence is derived from the DNA sequence 
of Plasmodium falciparum strain FCB-1. 

5 # a process according to any one of claims 1 to 4, 

20 wherein the AT content is reduced from 74% to 55%. 

6. A process according to any one of claims 1 to 5, 
wherein the DNA sequence encodes the complete amino acid 
sequence including signal peptide and attachment signal. 

25 

7. a process according to any one of claims 1 to 5, 
wherein the DNA sequence encodes the complete amino acid 
sequence except for the attachment signal. 

30 8. A process according to any one of claims 1 to 5, 

wherein the DNA sequence encodes the complete amino acid 
sequence except for the attachment signal and the signal 
peptide. 

35 9. a process according to any one of claims 1 to 8, 

wherein the process comprises the following steps: 

(a) Design of a DNA sequence based upon the DNA 
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sequence from Plasmodium falciparum FCB-1, wherein the DNA 
sequence takes in to consideration the degeneracy of the 
genetic code but still maintains the amino acid sequence; 

(b) Division of sequence into overlapping regions 
wherein regions consist of p83, p31ql, p36, gp30 and gpl9; 

(c) Synthesis of desoxyoligonucleotides, wherein 
each of these extend the whole length of a region; 

(d) synthesis of the regions coding for gpl9, 
p30, p3 6, p31 by PCR and synthesis of the region coding for 
p83 by fusion of two sequences comprising approximately 
1200bp; 

(e) individual cloning of coding sequences; 

(f) fusion of the complete gene; and 

(g) expression in a suitable expression system. 

10. A recombinant process according to claim 9, 
wherein the desoxyoligonucleotides synthesized in step (c) 
should be on average 120 nucleotides long and in each 
instance overlap the neighbouring sequences by about 20 
bases. 

11. A recombinant process according to any one of 
claims 1 to 8, wherein the expression vector is 
dPS56,RBSII. 

12 . A recombinant process according to any one of 
claims 1 to 8, wherein the expression vector is pBi-5. 



13 . A recombinant process according to any one of 

30 claims 1 to 8, wherein the expression vector is ppTMCS. 
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14. A recombinant process according to any one of 
claims 1 to 13, wherein the DNA sequence is expressed in E. 
coli. 

15. A recombinant process according to claim 14, 
wherein the E. coli strain used is the represser-producing 

\\taelb«filas\haneS\Bkroc\K««p\sp«c 1X48649*97. doc 21/06/00 
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strain E. coli DHSalphaZl. 

16 ■ A recombinant process according to any one of 

claims 1 to 13, wherein the DNA sequence is expressed in 
HeLa cells. 

17 ■ A recombinant process according to any one of 

claims 1 to 13, wherein the DNA sequence is expressed in 
CHO cells. 

18 • A recombinant process according to any one of 

claims 1 to 13, wherein the DNA sequence is expressed in 
Toxoplasma gondii or Leishmania. 

15 19 • A complete DNA sequence which encodes the 

gpl90/MSPl surface protein of Plasmodium, wherein the DNA 
has a reduced AT content compared to the naturally- 
bcqurring DNA sequence. 

20 20 • A complete DNA sequence according to claim 19, 

wherein the Plasmodium is Plasmodium falciparum. 

21 • A DNA sequence according to claim 19 or claim 20, 
wherein the DNA sequence is derived from the DNA sequence 

25 of Plasmodium falciparum strain FCB-1. 

22 • A DNA sequence according to any one of claims 19 
to 21, wherein the AT content is reduced from 74% to 55%. 

30 23 • A DNA sequence according to any one of claims 19 

to 22, wherein the DNA does not code for the attachment 
signal . 

24. a DNA sequence according to claim 23, wherein the 
35 DNA does not code for the signal peptide. 

25. a DNA sequence according to claim 24, wherein the 
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DNA includes a sequence which encodes 11 additional amino- 
acids, of which 6 are histidines, present at the N- 
terminus . 



5 26. A DNA sequence according to any one of claims 19 

to 25, wherein the sequence includes no recognizable 
"splice donor" and "splice acceptor" signals. 

27. A DNA sequence according to any one of claims 19 

10 to 26, wherein the sequence includes no large GC-rich 
sequences . 
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28. A DNA sequence according to any one of claims 19 

to 27, wherein the sequence includes no recognition signals 
for restriction enzymes recognizing sequences of six or 
more base pairs. 



• • • 

• 9 9 
• • 



29^ A DNA sequence according to any one of claims 19 

to 28, wherein the sequence for recognition signals of 
20 particular restriction nucleases in regions which, after 
processing of the protein, separate existing domains, 
contains uniquely occurring cleavage sites for restriction 
endonucleases . 



25 30. A DNA sequence according to any one of claims 19 

to 29, wherein the sequence has at both its ends cleavage 
sites for endonucleases which do not appear in the 
remaining sequence and in a vector for utilization. 



• • • 
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30 31.. A host organism which contains the complete DNA 

sequence according to any one of claims 19 to 30. 
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32. A host organism according to claim 31, wherein 
the organism is E. coli. 

33. A host organism according to claim 32, wherein 
the E. coli strain is the repressor-producing E. coli 
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strain DH5alphaZl. 
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34. a host organism according to claim 31, wherein 
the host organism is HeLa cells. 

5 

35. a host organism according to Claim 31, wherein 
the host organism is CHO cells. 

36. a host organism according to claim 31 or claim 
10 32, wherein the host cells synthesize constitutively MX. 

37. a host organism according to claim 31, wherein 
the host organism is selected from the group consisting of 
Toxoplasma gondii, Leishmania. baculovirus, adenovirus and 

15 yeast. 
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38. use of a gpl90/MSPl protein produced by a process 
according to any one of claims 1 to 18 for active 
immunization against malaria. 

39. use of a gpl90/MSPl protein produced by a process 
according to any one of claims 1 to 18 for producing 
monoclonal antibodies suitable for passive immunization. 

40. use of a DNA sequence produced by a process 
according to any one of claims 19 to 30 for producing a 
vaccine based on nucleic acids. 

41. A process for the stabilization of gpl90/MSPl 
gene sequence from Plasmodium, comprising the step of 
reducing the AT content of the sequence compared to the 
naturally-occurring sequence. 

42. a stabilized Plasmodium gpl90/MSPl gene having a 
reduced AT content compared to that of an unstabilized 
gene . 
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43 • A vector containing a DNA sequence according to 
any one of claims 19 to 30 and/or claim 42. 

44 • A host cell containing a vector according to 
claim 43 . 



45 • A vaccine containing a protein produced by a 

process according to any one of claims 1 to 18 and/or a DNA 
sequence according to any one of claims 19 to 3 0 and/or a 
host according to any one of claims 31 to 37 and/or a 
vector according to claim 43. 

46 • A vaccine according to claim 45, further 

comprising immunity-promoting products from Plasmodium. 

47 • A vaccine according to claim 46, wherein the 
Plasmodium is Plasmodium falciparum. 

48 • A process according to claim 1 substantially as 
hereinbefore described with reference to any one of the 
examples . 



Dated this 5th day of October 2000 
HERMANN BUJARD 
By their Patent Attorneys 
GRIFFITH HACK 

Fellows Institute of Patent and 
Trade Mark Attorneys of Australia 
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Summary 

Recombinant manufacturing process for a complete malaria antigen gp190/MSP1 

The invention concerns a recombinant manufacturing process for the complete gp!90/MSP1 
surface protein of Plasmodium, in particular P. falciparum, as well as the complete DNA 
sequence of this protein and suitable host organisms for expression of the sequence, by 
means of which the protein in its entirety can for the first time be synthesized outside the 
parasite. 

The invention opens up for the first time the possibility of manufacturing the gp190/MSP1 
surface protein in sufficient quantity; in addition there is the matter of making gp190/MSP1 
available as a vaccine. 

Finally the invention sets out a process for the stabilization of AT-rich genes (Fig. 1). 
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from FCGm" 68 °* ^ (9Pl9 ° n) a " d ° f thG s y nthetic 9 ene (9Pl90s)for gpi9 0 



AS MKIirrLCSTLrriXNTOCVTHSSrOE 27 

gpl90 n C A TT A T T A A A T A A T ACT A A 

gpUO* CSC^gCCATCAAAAJC Ai I - 1L- llwllll^KJU. 1 Hill 1 - 1 1 . ^ TCAJCAATACTCACTCCC^CACCCACSAArCC^TCACCAC 



90 



ASZ.VXXLEAlS0AVtTCTlLrOXEXMVI.MSGT 57 
9pl90 n T C A A A A AT C A ? T TT ATA AA ATA A A 

5pl90* CTGCTTAACAAACrCCAACCTTTCCAACATCC CCTCC rT ACCCSASACAC CS Z GTTe CACAACSACAAC *T GGTGCTGAAICAAGCCA C C 160 

ASSGTAVTTS TPGSXCS VaS CCSCC SVASCC S 8? 
7pl50 n A A T T T T ACTA? TCA TA CATTATCA 

9pl90* ACTCOCACSCCCCTTACAACCACCACACCCCCTTCT^^ 270 

ASVASCCSVASCGSVASGCSCNSRRTMPSD NS 117 
9pl90 ft TTATTCATT T TTCA T AT TTCAA CTA TATTA 

gpl90* CICCCCTCSSS C CGCAC^STCCCATCACCT^^ 360 

ASSDSDAXSYAOtXBRVRHYtLTXXELXYPQL 147 
gplJO n T ATTAT TTTAAA AC TCTCTA AACATTACC 

9P1S0* AGCSATTCCCACCCCAACTCCTACCCCCACC^^ 450 

ASrolTNBHtTLCDNIHCrXYtXOCYEEIHEt 177 
9pl90 ft TTTA TATT T T T A TAT AT TA 

gpl90 # TTCSACCTCAcrAATCAXATCCTCACACI GTGyS AT 540 

AS * L T K LN r» T 0 LtRA K L NOV CA HOT COX P t H L 207 
gpi90 n TATAACTTTTAT A A T A TATT T AT CT 

gp!90 a CTCTACAACTTCAATTTCTAC^TCCA^ 630 

ASXIRAMEtOVLKXtvrGTXKPtDUZXOHVCX 237 
gpl90 A ATCTATAA CTAACTC AAA AT A TTAT A A 

g?190* AACATCACAGCCAACGACTTGCACCTATTCAACAAC^ 720 

ASMSDYZ'XXNXKTXSNZNSLXEESXXTXDXNX 267 
gp!90 ft C A AA A AT A TATAT ACT C A A T T 

. gp!90* ATCCAACJgTAlirrJVAAAAGAASAACAACA^ 810 

ASSATKSSEXXXLTOA-OTDtSXYtfXQLESABM 297 
<J?190^ T A A A A A A ATATTTTTCT ATA A T 

AStXSVtEXRXDTtXXM ESIXEttOXXMSIRH 327 
qpl90 n T A A TTAAATT TT AAA C T C TATT A A 

g?190 8 CTCAICA«5TACTC<ykCAACCCCATACACA^ 990 

ASPPPANSGNTPNTLLD XHXXX EEB2XEXXE Z 357 
gpl90° CAC T ATAATTCTT A A C A A AA AT 

g?190* CCTCTCCCACCCAACTCTCCCAACXKCCTAAC^^ 1060 

ASAXTZXrHIOSLrTDPtELETYLREXMXMXO 387 
gpl»0 n T A T T T AC T A A AT A A TA AA A TT 

gpl90 9 CCCAAAACCATTAACTTCAACATAC A1 rCTCTCTT :A CTCATCTrCTTCACC7^aAGTACT^TTCA^ 1170 

AS ISAKVSTKESTSPNE TPMCVTTPLSYNOIN 417 
<7?190 n AACT ACTA TC AATTTTA T 

?p 190* ATCTC T S CCAAACT^CACACAAACGAATCAACCCAACCTAATCAATATTCCJU CI < M TC r CA CCTACC LTlTl IX. rTA CAACCATATCAAC 1260 

ASMALHELHSrCOtZMPrOYTXEPSXMZTTON 447 
9?i90 ft T TATAT TCT T T A TAT A AAC A C A T T T 

9?i90 ft AACSCTCTCAACSACCTCAATACCTTCCCTCACr^^ 1350 

ASERXXrXHEXKSXtXXSXKXX ESDKXSYSOR 47? 
gp!90 A A AACAT T A A T A A A A ATCTATC AA 

gpl90* CACACAAACAACTTTAT^AACC*^ 14 40 

ASSXStNDXTXSTSXLL MEXYOSXTHNMIDLT 507 
9pl90 n TCT CTC TT AA A AT ATTAT AC T TATTAT 

9P190* ACCAAAACTCTAAACCATATCACTAAACACTATC^^ 1530 

ASMPEKHMCXRYSYXVS XLTBSHTYASYEfS X 537 
5?190 n TA TAATATT T T A A 

<7P190 B AACTTCCACAAAATCATGCCAAAACSCTA L IC . rACAAACTCCACAAACTCACACACCATAATACC fT IT^ CATCCTATCACAATTCTAAC 1620 
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AS 

as 

9pl90 ft 
AS 

$J>190° 
gj>190* 

AS 

gp!90° 
AS 

9P190 3 
9P190* 

AS 

w i9o; 

9pl9Q* 



Y Y 



SHLCKLTKACXTKSOTSIKHZVVIKSI 

* T A A A a? ? AA T A A 7 A T A - 

^tAATCTTCACAACCTCACCAAACCrCTTAA^ 

KMLISKtSVClSTLVtHlltXOESOtrsxx: 

ATA CAA T T A AT A AT A A CT AAA 

AAGAATCTCAXAACTAACAICCAAAACGAGATCGAGACCCTTC^ 

?R0CMKPDSXIX,8VS»IVKVOV0KVtl.H»JC 

T AATTAAATC A A T A A TT AT A A 

ACAAAAGACJ^AAAIAAACCAGATCAS^LIkGATC^^ 

ZDCLXKT0J.XLXMVELX8N IB VpHSYXOSN 

C T A A TCTAATAA T C TC C A A A 

ATraTCAACTOACAACACTCAACTCATTC 

KOS'YYLXVLKXSXDXLBVrH PXVESLXilS 

a ttttatctca at? t a tca atcat a t 

atac?acc?gatcstactcaacaaacacaxagacaaacttjut at j v' j 1 1 u atgcccaaactccacagcctcatcaacgaa 



EXXNIKtEGOSDHSEPSTS 

AAA A A TAG TCAAAC 
CACAACAACAACATTAAAACTCAAGCAOUCTCACATA 



c b itcoattxp 

A A A A T A A T 
TACCACCAACCCC 




gplfO* 

AS 

OT l90 n 
9P190» 

AS 

9T>i90* 
9pl90 § 

AS 

TO 19Q a 
* W 190» 



5*190* 
*?190» 



$pl9Q n 
gpLSO* 



VPCAXAQVPTPPAPVHHXT £ H VSXLDYtSX 

* A A A CA AAA TATA TTCTATT AA 

<5t7TXAGACGCTAAACCTCAACTCCC*ACAC^^ 

LYXTLBTSTICBXYILVSa ST ttNEXXLXQ? 
T A A TT A 7 A T A T TTCTATCA A AT A AT 

CTCTATCACTTCT^CAAXACATCCTACATC^ 

XITKCSeSXLSSCDPLOLt.rH XQKKXPVHY 

ATA GAAC T AACT A T A T AT T A A T ATA 

AAGATAACCAACCAACAGCACACIAAACTCTC ^ TCT i^l C ATCCACTGGACCTCCTCTrCAATAgCCAftAACAACA ^ »l ZVI ATCTAT 

S K F 0 S LMNSC.SQS.rKEX YE XE HVCMLYXLX 

T'TA AG T A A AT AT AAA T T TT A TC 

TCTAICTTCCATAGCCTCAACAATT^^ 

OMOKXXKLLCSAXXVSTSVXTLSSSSMQPL 

TT A A TT AT A C A A A A A T AACTTCA A T A 

CACAACSACAACATTAACAAC C Al IC CACCAACCTAACAAGCrCTCCAC^TCTC rrAAAACa ^ TITCt t CC AGCTCCATGCAACCACTG 

SLTP03KPEVSAHDDTS BS TKLSKSLXLTS 

AT A GTA ATA TTTA ATTA7TG tACTT ATA A 

TC^TCACACtrTCAACACAAGCCCGAACTCAK 

ASHILSLGXKXHITOELICOX SStBfYEXItX 
<&l9Q n AT AC TAACATA TAATA ACXACX A T T A T A 

9P190* AACATCCTCTCrCTCCGOUjCAATA 

ASOSOTrYHESrTSrVXSXABOIKStHOESXX 
S?L9<^ T T T T T ATCT TATTA T TTAICT A AG 

9Pl'° GACACGGACACATTCTATAACCACACCTTCACTAACrTC 

A5XXLEEOX HXLXXTLQLS fOLYMXYXLKLEB 
gpl9<T AT A ATT AT A A A TT A GT ATCA TTTATTAT T A T A A 

TO" 0 'AACAACCTCGAAGACfiACATCAATAACCTGAAGAAGACACTCCAACT^ 

ASI.fOXXXTVCXYXHOlXXLTl.tXEOLESXtN 
9pt9C i T t T ?A * TTA A A T A ACT TATA A A AT A TCA T C T 
TPi'C CTtTTCCACAA G AACAACACACTCSSCAACTAXAACATC^ 

ASSwMHPXBVLQ 
9Pl90 n TTCACT T A A 

qplSO" TCACTCAACAATCCCAAACACCTW 

ASLEMTXILLXBYKCIVXYYHCBSSPLXTL S E 
9pl90° T A A A A AT AT C TT ATTA T A A AT A A 7 AACT A 

9Pl«Q CTGCACAACACCAAC Ar ^ r iC ' AJ UlACACTACAAACCCCrc > Xtf AI T C 1C AACACTCTCTCCGAC 

ASESrOTEOHYAStSHrXVtSXLECXtXOHlH 

AICA T. AAA TT TT A A T A AT AAC AT A A AT A TTTAT 

«pl90 GAGAGCATCCAGACCGACCATAACTACGCCACCCTCCAGAACTTCAACC 
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ASLCXKXLSYtSSCLBBLIA ELXEVIXHKNYT 
gpl90j| T A A A AT AICA TAA T T A TT A T T AT A A AAATATTA 

9P 1 • 0 CTCGACAACAACAACCrOCCTA C Cl'C TC :ACC C^ CT CCATCACCTCATCSCCCACCTCA fcCT-A M T TCATTAACAACAACAACrACACC 

*S CMSp SEMNTOVN»ALEST XXTLPECTDVAT 
9*190° T TCT TA G T T C T 7 A A A T C A TAA 

9P190 CCCAATACCeCAACCCACXJgAATACACACSTCAATAACCCACTC^^ 



ASVVSESCSOTLEQSQPJCRP AS TiVCAtSNTI 

? - **g ACA T A A AAC * A A A A A ? C A 

gpl»0 CICC1C .U lt-A>TCTGCCTCCt^CACACTC5A«»^ 

ASTTSQHVODEVDOVIIVPt PCESEEOYDOLC 
gpl90 A A A T T A A A A A ATAATCAATT TT A A 

gplSO ACCACATCTCACJUCCTCCACOlX^^ 

AS0VVTCtAVTP5v:0NILS X Z EH5YEVLYLK 
5P190 AAAAAAAA A ATT TAT T C TT A T A 

5pl90 CACGTCCTCACCCCrCACCCrCTCAC I CC riXXl^CTGATAACXTTCTCTC^^ y.TAICTGAAA 

AS7tACVYKSLXXQt,tNMVH?rHVHVX0ILNS 
^190* C~4tGCCAGCC~^TAX AACTAX « * A ? AT T T T T T A TTCA 

A|RrMXRBMfKMVLESOtZP YXDITSSHYVVX 
gpiSO A AC T A T T A AXGA T A A TT A A AAC T T A 

gp 1 9 0 1 CCCrTTAATAACACA«^AArrTCAAGAACC^ 

ASOPYXPLKKEXX*' DXFISSYHTIEDSIDTDIH 
9P190" T TATT AAA CT AACC TT T TAATC A 

9PI90 CACC CATA^AACTTCCTCAATAA A CACAACACC SATAA AT T TCT CTC 'l A ITTTACAACTATATCAAC^ACICSATCCACACCSAXATCAA? 

ASrAHOVtCrrXILitXYKS 02. osix. xyzwok 

gpl90 - A TTA T A AT ATC TAATTATA A C A 

qpl J 0 TTCCCTAATeATCTCCTCa«rATTACAACArCCTOCC 

A5QCENEKYZ.PrLHNXZT LY KTVHDXZOirVX 
<T?190" TA C CT TTACTTC TATA TTT TTTAT 

9pi90* CAAIMCCAI^ATCAAAAATATCT«=CTTCC^^ 

ASBLSAXVLHYTYEXSNVSV XI XEINYLXTIQ 
gpl9<T TTAAAATAT AT AXCA CA AAATTTA T 

gpl90* CACCT<aACCCCAACCTCCTCAACTATACTTACCA^ 

ASDXLADrXKHMHrVCZAOL ST0YMBNNL1TX 
9pl*° « T A ? TTTAAA T T CT AT A 

. gplSO CACAAX^rTCOCACArrTCAACAAAAATAACAAT^ 

ASFLSTCMVySNLAXTVLSH LI DCNLQCMLNI 
SPl9 °. __5_ YAC7 AT 77 TTT CTT ATCT TATTAT AT T A T 

qpL90 ITU.: irwlACTC^^JCCTCTTCCAA >^>C CACCCCAACCTr w TAfy;sCATCCTCAACATC 

ASSOBQCVXXQCPOHSCC P X BLOEXEECXCLL 
. A* AA ATAATCTA ATATAAA ATA T AT A 

TCCCA^^CCAATCCCT^ACAAACACTC^ r C T C XL 1 ^ 

ASKTXQECDXCVZM PBPTCB emhcccoaoaxc 
9PU<J T T ATTAT T TTC T TA TA C T 

9P190 AACTACAAACAACAAK^CATAAISCCCTCCACAACCC^^ 

AS?5Z5SCSMCKXITCEC?KPDSTP!.r0CIfC 
?pl90- A TTCA TACC TA A T T T T TT C 

qpl90* *CCCACCAACACACC««CTAACGCAAfcCy^^ 

ASSSSHrLCirPLtZLMLJLYSri-* 1629 
9pl9C* ACTTC C T A A A CA T AT A X T A AT A T T 

9PI90 TCCACCTCTAATTTCCTCCCC XaC .: C .-1 l ^li^ T-~^^^iwr~v^^Af2r^ 4940 

•top codoo Cla 1 
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